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Abstract 

We pose a new and intriguing question motivated by distributed computing regarding random walks 
on graphs: How long does it take for several independent random walks, starting from the same vertex, 
to cover an entire graph? We study the cover time - the expected time required to visit every node in a 
graph at least once - and we show that for a large collection of interesting graphs, running many random 
walks in parallel yields a speed-up in the cover time that is linear in the number of parallel walks. We 
demonstrate that an exponential speed-up is sometimes possible, but that some natural graphs allow 
only a logarithmic speed-up. A problem related to ours (in which the walks start from some probablistic 
distribution on vertices) was previously studied in the context of space efficient algorithms for undirected 
s-i-connectivity and our results yield, in certain cases, an improvement upon some of the earlier bounds. 
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1 Introduction 



Consider the problem of hunting or tracking on a graph. The prey begins at one node, the hunters begin 
at other nodes, and in every step each player can traverse an edge of the graph. The goal is for the hunters 
to locate and track the prey as quickly as possible. What is the best algorithm for the hunters to explore 
the graph and find the prey? The answer depends on many factors, such as the nature of the graph, 
whether the graph can change dynamically, how much is known about the graph, and how well the hunters 
can communicate and coordinate their actions. Graph exploration problems such as this are particularly 
interesting in changing or unknown environments. In such environments, randomized algorithms are at an 
advantage, since they typically require no knowledge of the graph topology. 

Random walks are a natural and thoroughly studied approach to randomized graph exploration. A 
simple random walk is a stochastic process that starts at one node of a graph, and at each step moves from 
the current node to an adjacent node chosen randomly and uniformly from the neighbors of the current 
node. A natural example of a random walk in a communication network arises when messages are sent 
at random from device to device. Since such algorithms exhibit locality, simplicity, low-overhead, and 
robustness to changes in the graph structure, applications based on random walks are becoming more and 
more popular. In recent years, random walks have been proposed in the context of querying, searching, 
routing, and self-stabilization in wireless ad-hoc networks, peer-to-peer networks, and other distributed 
systems and applications [El EH H21 E01 El EH [H [TO] . 

The problem with random walks, however, is latency. In the case of a ring, for example, a random 
walk requires an expected G(n 2 ) steps to traverse a ring, whereas a simple traversal requires only n steps. 
The time required by a random walk to traverse a graph, i.e., the time to cover the graph, is an important 
measure of the efficiency of random walks: The cover time of a graph is the expected time taken by a 
random walk to visit every node of the graph at least once [5j [2]. The cover time is relevant to a wide 
range of algorithmic applications [211 [32| [23| [8], and methods of bounding the cover time of graphs have 
been thoroughly investigated [281 El (El 031 Ell EI]- Several bounds on the cover time of particular classes 
of graphs have been obtained, with many positive results [El [131 EU ESI [16] . 

The contribution of this paper is proposing and partially answering the following question: Can multiple 
random walks search a graph faster than a single random walk? What is the cover time for a graph if we 
choose a node in the graph and run k random walks simultaneously from that node, where now the cover 
time is the expected time until each node has been visited at least once by at least one random walk? 

The answer is far from obvious. Consider, for example, running k random walks simultaneously on a 
ring. If we start all k random walks at the same node, then the random walks have little choice but to 
follow each other around the ring, and it is simply a race to see which of them completes the trip first. 
We prove in Section El that on a ring the cover time for k random walks is only a factor of 0(log£;) faster 
than the cover time for a single random walk. On the other hand, there are graphs for which k random 
walks can yield a surprising speed-up. Consider a "barbell" consisting of two cliques of size n joined by 
a simple path (see Figure Q] in Section [7]). The cover time of such a graph is 0(n 2 ) and its maximum is 
obtained when starting the walk from the central point of the path. In this graph, the bells on each end of 
the barbell act as a sink from which it is difficult for a single walk to escape, but if a logarithmic number 
of random walks start at the center of the barbell, each bell is likely to attract at least one random walk, 
which will cover that part of the graph. We prove in Section [7] that if we run k = O(logn) random walks 
in parallel, starting from the center, then the cover time decreases by a factor of n from 0(n 2 ) to O(n), 
which corresponds to a speed-up exponential in k. 

The main result of this paper — summarized in Table Q] — is that, in spite of these examples, a linear 
speed-up is possible for almost all interesting graphs as long as k is not too big. In Section IH we prove 
that if there is a large gap between the cover time and the hitting time of a graph, where hitting time 
is the expected time for a random walk to move from u to v for any two nodes u and v in the graph, 
then k random walks cover the graph k times faster than a single random walk for k sufficiently small (see 
theorems E] and E])- Graphs that fall into this class include complete graphs, expanders, d-dimensional grids 
and hypercubes, <i-regular balanced trees, and several types of random graphs. In the important special 
case of expanders, we can actually prove a linear speed-up for k < n and not just k < logn. While we 
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Table 1: Results summary (for any constant e > 0) 



Graph family name 


Cover time 
C 


Hitting time 

fomax 


Mixing time 

trn 


Speed up 
lower bound 


(order) 
upper bound 


cycle 


n 2 /2 


n 2 /2 


0{n 2 ) 


log(fc) 


log(fc) 


2-dimensional grid 


Q(n log 2 n) 


Q(n logn) 


0(n) 


k,k < 0(log i_e n) 




d-dimensional grid, d > 2 


O(nlogn) 


0(n) 


<d{n 2 ' d ) 


fc, fc < 0(log i_e ra) 




hypercube 


O(nlogn) 


0(n) 


log n log log n 


fc, fc < 0(log 1_e n) 




complete graph 


G(nlogn) 


0(n) 


1 


k, k < n 


fc, k < n 


expanders 


<d(n\og n) 


6(n) 


logn 


fi(fc), fc < n 




E-R Random graph 


<d(n\ogn) 


0(n) 


logn 


fc, fc < 0(log i_e n) 





demonstrate a relationship between the cover time and the hitting time, we also demonstrate a relationship 
between the cover time and the mixing time (see Theorem [9]), which leads us to wonder whether there is 
some other property of a graph that characterizes the speed-up achieved by multiple random walks more 
crisply than hitting and mixing times. 

1.1 Related work 

A related problem was previously studied in the context of algorithms for solving undirected s-t connec- 
tivity, the problem of deciding whether two given vertices s and t are connected in an undirected graph. 
The key step in many of these algorithms is to identify large subsets of connected vertices and to shrink 
the graph accordingly. The algorithms use short random or pseudorandom walks to identify such subsets. 
These walks are either starting from all the vertices of G or from a suitably chosen sample of its vertices. 
Deterministic algorithms concerned with the amount of used space [29 \ [7] use pseudorandom walks started 
from all the vertices of G. Parallel randomized algorithms, e.g., |26|, [22], use short random walks from 
each vertex of G. Although there seems to be a deeper connection to our problem, these techniques do 
not seem to provide any results directly related to our question of interest. 

However, a problem closer to ours is considered in a sequence of papers on time-space trade-offs for 
solving s-t-connectivity |14[ \TT\ [20] . Algorithms in this area choose first a random set of representatives 
and then perform short random walks to discover connectivity between the representatives. A part of the 
analysis in [H] by Broder et al. is calculating the expected number of steps needed to cover the whole 
graph. Indeed, Broder et al. state as one of their main results that the expected number of steps taken 
by k random walks starting from k vertices chosen according to the stationary distribution to cover the 

2 i 3 

whole graph is 0( m n ), where m is the number of edges and n is the number of vertices of the graph 
|14j . Barnes and Feige in [111 [20] consider different starting distributions that give a better time-space 
trade-off for the s-i-connectivity algorithm but they do not state any explicit bound on the cover time by 
k random walks. In contrast, in this work, we formulate our interest in comparison between the expected 
cover time of a single walk and of k random walks. 

Although our work focuses on covering the graph starting from a single vertex, under certain conditions 
our results yield improved bounds on the cover time starting from the stationary distribution. In particular, 
for graphs with fast mixing time, Lemma [191 yields the bound 0((nlogn)/k) on the cover time of k random 
walks starting from the stationary distribution on an expander and the proof of Theorem [9] gives bound 
0((nt m log 2 n)/k) on the cover time of k random walks starting from the stationary distribution on graphs 
with mixing time t m . Indeed, our proofs in Section [4] do not depend on the starting distribution so similar 
results can be stated for k walks starting from an arbitrary probabilistic distribution. 

2 Preliminaries 

Let us begin with a quick review of asymptotic notation, like o(l), as used in this paper: f(n) = 0(g{n)) 
if there exist positive numbers c and N, such that f{n) < cg(n),\/n > N. f(n) = £l(g(n)) if there exist 
positive numbers c and N, such that f(n) > cg(n),\/n > N. f(n) = Q(g(n)) if f(n) = 0(g(n)) and 
f(n) = Q(g(n)). f(n) = o(g(n)) if lim^oo f(n)/g(n) = and f(n) = u{g{n)) if lim n _>oo f(n)/g(n) = oo. 
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Let G(V,E) be an undirected graph, with V the set of nodes and E the set of edges. Let n = \V\ and 
m = \E\. For v £ V, let N(v) = {u £ V \ (v, u) G E} be the set of neighbors of v, and let 5(v) = \N(v)\ 
be the degree of v. A ^-regular graph is a graph in which every node has degree 8. 

Let Xi = {Xi(t) : t > 0} be a simple random walk starting from node i on the state space V with 
transition matrix Q. When the walk is at node v, the probability to move in the next step to u is 
Qvu = Pr(w,n) = for (v,u) G E and otherwise. 

Let Ti(G) of a graph G be the time taken by a simple random walk starting at i to visit all nodes in G. 
Formally Tj = min{i : {JQ(1), . . . , = V} and clearly this is a stopping time and therefore a random 

variable. Let Ci = E[ri] be the expected number of steps for the simple random walk starting at i to 
visit all the nodes in G. The cover time C(G) of a graph G is defined formally as C(G) = maxjCj. The 
cover time of graphs and methods of bounding it have been extensively investigated [2E1 [31 [151 El EH IS], 
although much less is known about the variance of the cover time. Results for the cover time of specific 
graphs vary from the optimal cover time of ©(nlogn) associated with the complete graph K n to the worst 
case of 0(n 3 ) associated with the lollipop graph [T9l ITS]. 

The hitting time, h(u,v), is the expected time for a random walk starting at u to arrive to v for the 
first time. Let /i max be the maximum h(u, v) over all ordered pairs of nodes and let h m \ n to be defined 
similarly. The following theorem provides fundamental bound on the cover time C(G) in terms of /i max 
and h m i n . 

Theorem 1 (Matthews' Theorem |28j) For any graph G, 

hmin ' H n < C(G) <J ^-max ' H n 

where = \xi{k) + is the k-th harmonic number. 

Notice that this bound is not always tight, since in the line, for example, we have CiG^j — /i maj{ . 

For an integer t > 0, a graph G and its vertices u and v, let p t uv be the probability that a simple 
random walk starting from vertex u is at vertex v at time t and let tt(v) denote the probability of being 
at v under the stationary distribution of G. By mixing time t m of G, we understand the smallest integer 
t > such that for all vertices u in G, ^2 V \p l u v — vr(v)| < 1/e. 

2.1 k- Random Walks: Cover Time and Speed-up 

Let us turn our attention to the case of k parallel independent random walks. We assume that all walks 
start from the same node and we are interested in the performance of such a system. The natural extension 
to the definition of cover time is the k cover time: Let rf be the random time taken by k simple random 
walks, all starting at i at t = 0, to visit all nodes in G (i.e., the time by which each node has been visited 
by at least one of the walks). Let C\ = E[rf] be the expected cover time for k walks starting from i. For 
a graph G, let C k (G) = maxj (G) be the fc-walks' cover time. In practice, we would like to bound the 
speed-up in the expected cover time achieved by k walks: 

Definition 2 For a graph G and an integer k > 1, the speed-up, S k (G), on G, is the ratio between the 
cover time of a single random walk and the cover time of k random walks, namely, S k (G) = ^ . 

Note that speed-up on a graph is a function of k and of the graph. When k and/or graph is understood 
from the context we may not mention them explicitly. 



3 Statement of our results 

We show that k random walks can cover a graph k times faster than a single random walk on a large class 
of graphs, a class that includes many important and practical instances. We begin with a simple statement 
of linear speed-up on simple graphs, but as we broaden the class of graphs considered, our statements of 
speed-up become more involved. We begin with a linear speed-up on cliques and expanders: 
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Theorem 3 For k < n and for a graph G that is either a complete graph on n vertices or an expander 
the speed-up is S k (G) = tt(k). 

We can show a linear speed-up on other graphs, as well, but to do so we must bound k, the number of 
random walks. Which bound we use depends on Matthews' bound. 

When Matthews' bound is tight, we can prove a linear speed-up for k as large as k < logra. Our proof 
depends on a generalization of Matthews' bound for multiple random walks: C k (G) < c+ ^"^ • h max ■ H„ 
(see Theorem 1 1 3 j) . Since Matthews' bound is known to be tight for the complete graph, expanders |15j . 
d-dimensional grids for d > 2 |15j . d-regular balanced trees for d > 2 [33], Erdos-Renyi random graphs 
[16j . and random geometric graphs [9] (in the last two cases, for choice of parameters that guarantee 
connectivity with high probability) , the following result shows that k < log n random walks yield a linear 
speed-up for a large class of interesting and useful graphs: 

Theorem 4 IfC(G) = 9(/i max log n), then S k (G) = Q(k) for all k < logn. 

When Matthews' bound is not tight, we must proceed more indirectly and bound k in terms of the gap. 
Let g(n) = h be the gap between the cover time and the maximum hitting time. We find it remarkable 
that, using this gap, we can prove a nearly linear speed-up for k less than g(n) without knowing the actual 
cover time. 

Theorem 5 If g(n) = — > oo and k < 0{g 1 - e {n)) for some e < I, then C k (G) = ^ + o(^), and 
S k (G)>k-o(k). 

These results raise several interesting questions about speed-ups on graphs in general: is k an upper 
bound on the best possible speed-up, does proving a linear speed-up generally require bounding k, and 
what really characterizes the best possible speed-up? 

For the first question, we have been unable to prove that k is an upper bound on the best possible 
speed-up. We do know that a wide range of speed-ups is possible, and that sometimes the speed-up can 
be much less than k. The following result shows that the speed-up on a cycle is limited to log k. 

Theorem 6 For all k < e n / 4 , the speed-up on the cycle L n with n vertices is S k (L n ) = O(logfc). 

On the other hand, it is possible there are graphs for which the speed-up is much more than k. For 
example, the following result shows that, when the walk starts at the node in the center of the graph (we 
can't prove this is true from other nodes in the graph), the speed-up is exponential in k: 

Theorem 7 For a bar-bell graph B n on n vertices (see Section^for a definition) if v c is the center of the 
bar-bell then C Vc = 0(n 2 ) but C* = 0(n) for k = O(logn). 

For the second question, proving a linear speed-up in general does indeed require bounding k. In fact, 
the situation turns out to be rather complex, since the speed-up depends not only on the graph itself, but 
also on the relationship between the size of the graph and k. For example, using Theorem [6l we can show 
that there may be a full spectrum of speed-up behaviors even for a single graph: 

Theorem 8 Let G be a two dimensional \fn x yjn grid on the torus (for which Matthews' bound is tight). 

1. For k < logn, the speed-up is S k (G) = J7(/c) 

2. For k > log 3 n the speed-up is S k (G) = o(k). 

Finally, what property of a graph determines the speed-up? We do not have a complete answer to this 
question. We are able to relate the speed-up on a graph to the ratio between cover time of the graph and 
the maximal hitting time of the graph as seen in Theorem [5] and further also to the mixing time of the 
graph. Intuitively if a graph has a fast mixing time then the random walks spread in different parts of the 
graph and explore it essentially independently. 
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Theorem 9 Let G be a d-regular graph. If the mixing time of G is t m then for k < n the speed-up is 

s k = n{ T -^—) 

V tm In n ) 

To this end, questions regarding minimal and maximal bounds on the speed-up as a function of k remain 
open, but we do conjecture that speed-up is at most linear and at least of logarithmic order: 

Conjecture 10 For any graph G and any k > 1, S k (G) < O(k). 

Conjecture 11 For any graph G and any n > k > 1, S k (G) > Q(logk). 

4 Linear speed-up 

Linear speed-up in a clique follows from folklore, and we will show linear speed-up in an expander in 
Section 14.11 so we begin by stating this simple example from folklore for later use: 

Lemma 12 For k <n and a clique K n of size n the speed-up is S k (K n ) = k (up-to a rounding error). 

We now show a linear speed-up in a much larger class of graphs, as long as k < logn. We begin with 
Matthews' upper bound C{G) < /i max ■ H n for the cover time by a single random walk, and generalize the 
bound to show that k random walks improve Matthews' bound by a linear factor: 

Theorem 13 (Baby Matthew Theorem) If G is a graph on n vertices and k < logn, then 

C k (G) < , - - • fomax • H n . 

k 

Proof. Let the starting vertex u of the &-walk be chosen. Fix any other vertex v in the graph G. Recall, 
for any two vertices u',v' in G, h(u',v') < h m3iX . Thus by Markov inequality, Pr[a random walk of length 
e/imax starting from u does not hit v] < 1/e. Hence for any integer r > 1, the probability that a random 
walk of length er/i max does not visit v is at most l/e r . (We can view the walk as r independent trials 
to visit v.) Thus the probability that a random fc-walk of length er/i max starting from u does not visit 
v is at most l/e fcr . Set r = [(Inn + 2 In Inn)/ k~\ . Then the probability that a random A;- walk of length 
er/i max does not visit v is at most l/(n In 2 n). Thus with probability at least 1 — (1/ In 2 n) a random A;- walk 
visits all vertices of G starting from u. Together with Matthews' bound C(G) < h mauX H n , we can bound 
the /c-cover time of G by C k {G) < er/i max (l + 1/ln 2 ) + C(G)/ln 2 n < (e + o{l))h max H n /k. The theorem 
follows. □ 

When Matthews' bound is tight, we have C{G) = 0(/i max logn), and the linear speed-up is an imme- 
diate corollary of Theorem [T3l 

Theorem H If C(G) = 9(/i max logn), then S k (G) = £l(k) for all k < logn. 

When Matthews' bound is not tight, the proofs become more complex. We begin with the following 
result expressing the A;- walk cover time in terms of the single- walk cover and hitting times: 

Theorem 14 For any graph G of size n large enough and for any function f(n) £ u>(l) 

C k (G) < (1 + ■ C(G) + (3 log k + 2/(n)) • fr max . 
k 

The proof is at the end of the section. In this case, we get at least an order of linear speed-up when this 
upper bound is dominated by the left term. Choosing f(n) sufficiently small, informal calculation shows 
this happens when logfc • /i max < C/k or A;logfc < C//i max , which happens when k = (C//i max ) 1_e . Once 
again, when Matthews' bound is tight and C//i max = logn we have the following approximation to our 
previous result, which improves the linear speed-up constant from 1/e to 1 at the cost of a slight reduction 
in the choice of applicable k: 
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Corollary 15 If C = 6(/i max logn) and k = ©(log 1 e n) for some e < 1, then C k = ^ + o(^), and 
S k {G)>k-o{k). 

When Matthews' bound is not tight, we have the following result expressed directly in terms of the gap 
(l(n) = , c between the cover time and the hitting time: 

a V ' "max ° 

Theorem [5] If g(n) = ^ oo and k = 0{g l ~ e (n)) for some e < 1, then C k (G) = § + o(f ), 

and S k (G)>k-o(k). 

Proof. Set f(n) £ uj(1) in Theorem O to be log(g(n)), and the claim follows. □ 

We now prove Theorem [ID Our main technical tool conceptually different from our previous proofs is 
the following lemma (see the appendix for the proof). 

Lemma 16 Let G be a graph and ux,...,Uk be some of its vertices, not necessarily distinct. Let T c and 
p c be such that a random walk of length T c starting from u\ visits all vertices of G with probability at least 
p c . Let T/j and p^ be such that for any two vertices u and v of G, a random walk of length starting from 
u visits v with probability at least ph- Let £ > 1 be an integer. Then a random k-walk of length T c /k + £Th 
starting from vertices u\, . . . , covers G with probability at least p c (l — k(l — PhY)- 

Next, we use the following bound on the concentration of the cover time by Aldous [I]: 
Theorem 17 (|4j) For the simple random walk on G, starting at i, if Ci/h mayL — > oo then 

n/Ci ^ l. 

Equipped with the proper tools we are ready to prove Theorem 1 14i 
Proof of Theorem VTA If the conditions of Theorem [T7] do not hold then the cover time and hitting time 



are on the same order and Theorem [TH gives a trivial (not tight) upper bound. Assume the conditions of 
Theorem [T71 holds. Theorem [T71 implies that Pi[t u /C u > 1 + 5 n ] < e n where 5 n ,e n — > as the size of the 
graph goes to infinity. Thus Pr[a random walk of length (1 + o(l)) • C covers G] > 1 — o(l). By Markov 
bound, for a fixed vertex v of the graph, Pr[a random walk of length 2/i max visits vertex v] > 1/2. If we set 
t = log k+uj(l), then Lemma [TBI implies that a random /c-walk of length L = ( 1+ °^ c -\- (log fc+a;(l))2/i max 
covers G with probability at least (1 — o(l)) • (1 — k2~ e ) = (1 — o(l)) • ^1 — ^jy^) = 1 — o(l). Here each of the 
k random walks may start at a different vertex. Thus a walk of length i ■ L does not cover G with probability 
at most [o(l)] 1 so the cover time of G can be bounded by LJ2i i ' [°(1)P = L ' 1-1(1) = L ■ (1 + o(l)). □ 

4.1 Linear speed-up on expanders 

In this section we prove that for the important special case of expanders, there is a linear speed-up for k 
as large as k < n: 

Theorem 18 If G is an expander, then the speed-up S k (G) = f2(fc) for k < n. 

An (n, d, A)-graph is a d-regular graph G on n vertices so that the absolute value of every nontrivial 
eigenvalue of the adjacency matrix of G is at most A. It is well known (see [6]) that a d-regular graph on 
n vertices (with a loop in every vertex) is an expander (that is, any set X of at most half the vertices has 
at least c\X\ neighbors outside the set, where c > is bounded away from zero), if and only if there is a 
fixed A bounded away from d so that G is an (n, d, A)-graph. Since the rate of convergence of a random 
walk to a uniform distribution is determined by the spectral properties of the graph it will be convenient 
to use this equivalence and prove that random walks on (n, d, A)-graphs, where A is bounded away from 
d, achieve linear speed up. In what follows we make no attempt to optimize the absolute constants, and 
omit all floor and ceiling signs whenever these are not crucial. All logarithms are in the natural basis e 
unless otherwise specified. 
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Lemma 19 Let G be an (n, d, A)- graph. Put s = 1°^^ and b = -g^x- Then, for every two vertices u,v 
of G, the probability that a random walk of length 2s starting at u, covers v is at least 2n+4s+4bn ■ 

Proof. For each i, s < i < 2s, let Yj be the indicator random variable whose value is 1 iff the walk starting 
at u visits v at step number i. Let Y = X)i=s+i be the number of times the walk visits v during its last 
s steps. Our objective is to show that the probability that Y is positive is at least 2 n+As+ibn • ^° ^° so ' 
we estimate the expectation of Y and of Y 2 and use the fact that by Cauchy- Schwartz 

Pr[y > 0] _ y Pr[F _ i] > gfeggg = _ 1W (1) 

By linearity of expectation E(Y) = X^= s +l EOty- The expectation of Y"j is the probability the walk 
visits v at step i. This is precisely the value of the coordinate corresponding to v in the vector A l z, where 
A is the stochastic matrix of the random walk, that is the adjacency matrix of G divided by d, and z is 
the vector with 1 in the coordinate u and in every other coordinate. Writing z as a sum of the constant 
1/n- vector Z\ and a vector z 2 whose sum of coordinates is 0, and using the fact that Az\ = z\ and that 
the ^2-norm of A l z 2 satisfies 1 1 ^4* ^2 1 1 < we conclude, by the definition of s, that each coordinate of 
A l z deviates from 1/n by at most 

It thus follows that 



E(Y) > i. (2) 



By linearity of expectation 



2s 

E(Y 2 ) = E{Yi) + 2 

i=s+l s<i<j<2s 

Note that E(YiYj) is precisely the probability that the walk visits v at step i and at step j. This is the 
probability that it visits v at step i, times the conditional probability that it visits v at step j given that 
it visits it at step i. This conditional probability can be estimated as before, showing that it deviates from 
1/n by at most (X/dy~ l . It thus follows that 

2s 2 A 

E(Y 2 ) < E(Y) + 2 E^i- + ^(X/dY) < E(Y)[1 + - + 2—-]. (3) 

n n d — A 

j=s+l r>0 

Plugging the estimates d2J) an d © in JT]) we conclude that 

p r[ y>0]> <S2£ > s /' 2 "' = i . 

1 ~ E(Y)[1 + 2s/n + 2X/(d - A)] ~ 1 + 2s/n + 2b 2n + 4s + 46n 

This completes the proof. □ 

Corollary 20 Let G be an (n,d, X)-graph and define s = l°^Jx) > ^ = Suppose n > 2s, and let k 

be an integer so that ie ^ b+1 ^ n lo s w > 2s. For any two fixed vertices u and v of G, the probability that v is 
not covered by at least one of k independent random walks starting at u, each of length t = 16 ( 6 + 1 ) Tllo s n ; 
is smaller than \. 

Proof. Break each of the walks into i- sub- walks, each of length 2s. By Lemma [191 f° r each of these 
sub- walks, the probability it covers v is at least 2n +4s+4bn — 4(6+i)n • ^°^ e that this estimate holds for 
each specific sub-walk, even after we expose all previous sub-walks, as given this information it is still a 
random walk of length 2s starting at some vertex of G, and this initial vertex is known once the previous 
sub-walks are exposed. It follows that the probability that v is not covered is at most 

/ 1 S \kt/2s , -kt/(8(b+l)n (Q -21ogn J_ 

{ 4(6 + l)n J n 2 ' 
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as needed. 



□ 



In the notation of the above corollary, the k random walks of length t starting at u cover the whole expander 
with probability at least 1 — 1/n. Since the usual cover time of the expander is 0(n log n) it follows that 
the expected length of the walks until they cover the graph does not exceed t + ^0(n log n) < 0(t). 

Note that for every fixed b, the total length of all k walks in the last corollary is 0(n log n), and that 
the assumption 16 ( fe+1 ^ TO logTO > 2s = 2 i°^J^ holds for every k which does not exceed b'n for some absolute 
constant b' depending only on b (as d/X = 1 + 1/6). This shows that k random walks on n- vertex expanders 
achieve speed-up U(k) for all k < n. 

5 Speed-up and Mixing Time 

Random walks on expanders converge rapidly to the stationary distribution. For graphs with fast mixing 
times, like expanders, the following theorem gives a second bound on the speed-up in terms of mixing time. 

Theorem [9] Let G be a d-regular graph. If the mixing time of G is t m then for k < n the speed-up 
is S k = ftfrnM 

\t m Inn' 

Proof. Let G be a (i-regular graph of size n. We show that the expected cover time of G by a ran- 
dom A:- walk is 0( tmn jf n ). As a cover time of any graph is at least nlnn the theorem follows. 

In this proof we represent a random fc-walk on G by an infinite sequence of random variables Xq , X\ , . . . , 
where Xi is the position of the 1 + (imodfc)-th token at step \i/k\ + 1. Define the random variables 
Yi = X v 

i/k\ k-6tm inn+(i mod k) ■ Hence, Y^s correspond to the position of the /c-walk after every 6i m lnn 
steps. Let a random variable Y( be Yi conditioned on a specific outcome of Yq, . . . , Yj_fc. Since t m is the 
mixing time of G and the stationary distribution of a random walk on G is uniform (G is (i-regular), the 
statistical distance of Y- from the uniform distribution on G is at most (l/e) 61nn < 1/n 6 . In particular, 
for any vertex v of G, |Pr[l^' = v] — l/n| < 1/n 6 . 

Thus, for any 1 < £ < n 3 and any sequence v\, . . . ,V£ of vertices 

(1/n - /n 6 Y < Prfy/K,' ■ • • Y/ = v x ■ ■ ■ v e ] < (1/n + 1/n 6 / 

Hence, 

1/n 1 ■ (1 - 1/n 2 ) < Vi[Y[Y2 ■ ■ ■ Y' t = v x ■ ■ ■ v t ] < 1/n 1 ■ (1 + 2/n 2 ). 

One can easily show (see the proof of Theorem [26]) that the probability that a clique of size n is 
not covered within lOnlnn steps by a random 1-walk is at most 1/n 9 . By the above bound distribution 
of Y[Y% ■ ■ -YI, for 1 < £ < n 3 is close to a distribution of a random walk on a clique. Hence, unless 
Y{, Y^, ■ ■ ■ , Y[ 0nlnn does not hit all the vertices of G, we can bound the expected cover time of G by 
(6i m Inn) • C k {K n ) ■ (1 + 2/n 2 ). If Y[, Y 2 ', . . . , Y{ 0nlnn does not hit all the vertices of G we can bound the 
cover time of G by the trivial bound 0(n 3 ). Since C k (K n ) = 0(nlnn/k) the claim follows. □ 

6 Logarithmic speed-up 

So far we have seen only cases where the speed-up in cover time achieved by multiple random walks is 
considerable, i.e., at least linear. In this section we show that this is not always the case and that the 
speed-up may be as low as logarithmic in k. The cover time of a cycle L n on n vertices is 0(n 2 ). We prove 
the following claim. 

Theorem[6] For any integer n and k < e n//4 , the speed-up on the cycle with n vertices is S k (L n ) = ©(log A;). 

Hence for a cycle even a moderate speed-up of w(logn) requires super-polynomially many walks, and 
to achieve speed-up of n e one requires 2^ n ' walks. The theorem follows from the following two lemmas. 
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Lemma 21 Let s > 1 and k > 1 be such that C k < n 2 /s for a cycle of length n. Then k > e s / 16 /8. 

Proof. Assume that C k < n 2 /s and we will prove that k > e s / 16 /8. Pick an arbitrary vertex v of the 
graph. Clearly, the cover time starting from the vertex v is C k < n 2 /s. Let a random variable T v be the 
cover time of a random fc-walk starting from v. By Markov inequality, Pr[T„ > 2n 2 /s] < 1/2. Hence, with 
probability at least 1/2 one of the k walks reaches the vertex v n /2 that is at distance n/2 from v in at 
most 2n 2 /s steps. For a single walk, if it reaches v n /2 starting from v in time at most 2n 2 /s, then there is 
1 < t < 2n 2 /s so that the number of its steps to the right until time t differs from the number of its steps 
to the left by at least n/2. Given that this happens, with probability 1/2 the number of steps to the right 
will differ from the number of steps to the left by at least n/2 also at time 2n 2 / s. This is because after time 
t we will increase the difference with the same probability as that we will decrease it since the probability 
of going to the left is the same as the probability of going to the right. By Chernoff bound, Pr[the number 

of steps to the left and to the right of a walk differs by at least n/2 at time 2n 2 /s] < 2e ien 2 < 2e~ s / 16 . 
Hence, the probability that a particular walk reaches the vertex v n /2 during 2n 2 /s steps is at most 4e~ s / 16 . 

Thus, Prfthere exists a walk that reaches v n M in time at most 2n 2 /s] < 4&>e~ ,s / 16 . Since this probability 
must be at least 1/2 we conclude that < k. □ 

Lemma 22 Let k be large enough and n be an integer. If k < e™/ 4 then C k < 2n 2 /\uk for a cycle of 
length n. 

To prove this lemma we need the following folklore statement (see the appendix for the proof). 
Proposition 23 Let c > 2 be a constant. For every even integer n > 16c 2 , 

e -3c 2 -4 < pr ^ c _ < X -n/2< cVn] < e - 2(c - 1)2 , 

where X is a sum of n independent 0-1 random variables that are 1 with probability 1/2. 

2n 2 



Proof of LemmalEE To prove that C k < let c = vhifc/2 and £ = n 2 /4(c - 1) . If a single walk 
during a random /c-walk of length £ on a cycle of length n makes in total at least £/2 + n/2 steps to the 
right then it traversed around the whole cycle. Note, n/2 = VI(c — 1). By the previous proposition, Pr[a 
single walk makes at least £/2 + n/2 steps to the right during a random walk of length £] > e~ 3c ~ 4 > 1/k, 
for k large enough. Hence, k walks walking in parallel at random for £ steps fail to cover the whole cycle 
of length n with probability at most (1 - l/k) k < 1/e. Thus C k < = e ^/( e _ ^ ^ 2n 2 /ln/c, for 

k large enough. □ 

Lemma [21] also implies the following claim. 

Theorem 24 Let G n ^ be a d-dimensional grid (torus) on n l l d x n 1 ^ x • • • n l l d vertices, d > 2. For any 
k, C k (G n4 ) >fl(n 2 / d '/ log k). 

Proof. We prove the claim for d = 2. The other cases are analogous. Consider the random A:-walk on a 
y/n x yfn grid (torus). We can project the position of each of the k walks to the x axis. This will give a 
distribution identical to a A:-walk on a cycle of size yfn where in each step we make a step to the left with 
probability 1/4, step to the right with probability 1/4 and with the remaining probability 1/2 we stay at 
the current vertex. In order for a /c-walk to cover the whole grid, this projected walk must cover the whole 
cycle. Thus the expected cover time of the grid must be lower-bounded by the expected cover time for a 
cycle of size y/n which is Q(n/logk) by Lemma [2TJ (Note the steps in which we stay at the same vertex 
can only increase the cover time.) □ 

Corollary 25 For a 2-dimensional grid G n ^,, S k {G n ^) < O (log 2 n log k). 
This corollary together with Theorem [J] implies Theorem [5J 
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Figure 1: Example barbell graph -B13, v c is the center of the bar-ball 

7 Exponential speed-up 

On some graphs the speed-up can be exponential in k for at least some choice of the starting point. For 
an odd integer n > 1, we define a barbell graph B n to be a graph consisting of two cliques of size (n — l)/2 
connected by a path of length 2 (see Figure [1]) . The vertex on that paths is called the center of B n and the 
cliques are called bells. The expected time to cover B n by a random walk is ©(re 2 ) since once the token is 
in one of the cliques it takes on average 0(n 2 ) steps to exit that clique. It can be shown that the maximum 
cover time is attained by starting the random walk from the center of B n . We show the following theorem 
(see the appendix for the proof). 

Theorem 26 Let n > 1 be an odd integer, v c be the center of B n and k = 20 Inn. The expected cover 
time starting from v c satisfies C„ = 0(n). 

Hence, the speed-up in a cover time starting from a particular vertex of a A;-random walk compared 
to a random walk by a single token may be substantially larger than k. In the case of B n the speed-up is 
Q(n) for 0(logn)-walks for walks starting at a particular vertex. 

8 Conclusions and Open Problems 

In this paper, we have shown that many random walks can be faster than one, sometimes much faster. 
Our main result is that a linear speed-up is possible on a large class of interesting graphs — including 
complete graphs, expanders, grids, hypercubes, balanced trees, and random graphs — in the sense that 
k < log n random walks can cover an n-node graph k times faster than a single random walk. In the case 
of expanders, we obtain a linear speed-up even when k is as large as n. Our technique is to relate the 
expected cover time for k random walks to the expected cover and hitting times for a single random walk; 
and to observe that if there is a large gap between the single-walk cover and hitting times, then a linear 
speed-up is possible using multiple random walks. Using a different technique, we were able to bound the 
A;-walk cover time in terms of the mixing time as well. 

Open problems abound, despite of the progress reported here. There are the standard questions 
concerning improving bounds. Is it possible that the speed-up is always at most k? Our single counter 
example was that multiple random walks starting at the center of the barbell achieved an exponential 
speed-up, but perhaps the speed-up is limited to k if we start at other nodes. Is it possible that the 
speed-up is always at least log kP. We have shown that the speed-up is log k on the ring, and we conjecture 
this is possible on any graph. 

Another source of open problems is to consider more general classes of graphs. Said in another way, 
our approach has been to relate the /c-walk cover time to the single-walk hitting time and mixing time, 
but is there another property of a graph that more crisply characterizes the speed-up achieved by multiple 
random walks? 
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A Proofs 



Proof of Lemma[TR The proof is conceptually simple. We introduce here a little bit of notation to describe 
it formally. For a sequence of vertices c = (co, c%, . . . , ct) and a random walk X on G starting from cq, c C X 
denotes the event Ai=o -^X*) = c »- p or two sequences c = (co, . . . , q) and cZ = (do, ■ ■ ■ ,d t '), where q = do we 
denote by cod = (co, . . . , Cf, d\, ■ ■ ■ , df). It is straightforward to verify, if X is a random walk starting from cq 
and Y is an independent random walk starting from do, then Pr[cC X & d Q Y] = Pr[coe? C X]. Last, for 
an integer m > 1 and a sequence c = (co, ci, . . . , Ck m -i), 4,i denotes the subsequence (c(j_i) m , . . . , Q m _i) 
for < i < k. 

WLOG T c is divisible by k. Clearly, the probability that a random /c-walk (Xi,...,Xfc) of length 
T c /k + ^TX on G starting from vertices ui, ■ ■ ■ , Uf. covers all of G can be lower-bounded by 



Pr 



V 



4,1 E Xl & h 2 O 4,2 E ^2 & • • • fyfc ° 4,fc E ^4 



where c is taken from the set of all sequences of vertices from G corresponding to walks of length T c on 
G that start in u\ and cover whole G, and h{ is taken from the set of all sequences of vertices from G 
corresponding to walks of length at most IT^ that start in and hit c^_i^ c /k f° r the first time only at 
their end. It is easy to verify that all the events in the union are disjoint. Hence, 



P 



c,h 2 ,...,h k 



Pr 



4,i E X x & h 2 o 4,2 E X 2 & ■ • • fyfc 4,fe E ^4 



• • • hh C Xt 



= £ Pr 

c,h,2,—,h k 

= J2 Pr ^ E Xl l • Pr ^ 2 E ^2] • • ■ Pr[^ C X fe ] 
c,h2,—,hk 

= ^ Pr[c □ Xx] • Pr[K 2 C X 2 ] • • • £ Pr[/7 fc C X fe ], 
c fa ftfe 

where the third equality follows from the independence of the walks. By our assumption Xc Pr [^!= X\} > 
p c . Since (1 — a)(l — b) > (1 — a — b) for a, b < 1, to conclude the lemma it suffices to argue that 
Xk. Pr [^ E > 1 - (1 - p h ) e for all i. Notice that Pr t^ E Xi] = Pr[ a random walk of length lT h 



starting from m visits c^_i^ c /^]. Since a random walk of length T/j fails to visit c^_i^x c /k with probability 
at most 1 — ph regardless of its starting vertex, a random walk of length ITh fails to visit c^_i^x c /k with 
probability at most (1 — Ph) ■ The lemma follows. □ 



Proof of Proposition \23[ The upper bound follows from Chernoff bound. The lower bound can be 
derived as follows. Pr[(c — < X — n/2 < Cypn\ = Y^^-{c-\)y/n^ Y ^ ~ = Por an ^ ^' 

Pr[X — n/2 = fc] = ( n /2+fc) /2 n - We will compare ( n /2+fc) with the central binomial coefficient ( n " 2 )- 

(n/2) _ n n/2 (w ~ j + 1) n/2+Cy/E j 

(n/2 + V) " i . ' + 

j=n/2+l ( n _ j + 1) 

J=1 (i-10 + i))' 

We upper-bound this ratio as follows: 

n£?(i + -j) < 

J n 
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2_ Cy/n,{cy/n + l) 

= e « " 2 
< e c2+1 . 

Now, for < x < 1/2, e~ 2x < 1 — x. Hence, 

n^ ( i_l (i + i)) > e -SE^0-+i) 
J n 

_4 (cy / H:+l)(eyH:+2) 

> e «' 2 

> e' 2c2 - 2 . 

Thus 



(n/2) 



Using estimates on Stirling's formula f > a/- 2 — •2™, we conclude that 

O \Ti. ZJ \/ P7T77. " 



(n/2+c^n) 
\n/2) — y e-7rn 



< e 3c2+3 . 



n \ In 



\n/2 + k) ~ \n/2J y 



k=(c— l)\/n 

The lemma follows. □ 



Proof of Theorem \2b\ With high probability none of the following three events happens: 
£1 In one of the bells there are less than 4 Inn tokens after the first step. 

£2 During the first lOn steps of the random /c-walk at least 2 In n vertices return to the center. 

£3 One of the bells is not covered within the first lOn steps. 

If none of the above events happens then each of the bells is explored by at least 2 Inn tokens. Two 
disjoint cliques of size m = (^—1) /2 are each covered by a random 2 In n-walk in expected time 2C 2lnn (K m ), 
by Lemma [T2l So if C is the expected cover time of B n by a random 1-walk then: 

C fc < 2C 2lnn (K m ) + Pr[(l)]C + Pr[£2 U £3](10n + C). 

We need to estimate the probabilities of the above events. By Chernoff bound, 

Pr[£l] < 2e-( 161nn ) 2 / 2 - 201nn < 1/n 5 

for n large enough. A single token returns to the center of B n within lOn steps with probability at 
most - H / 0n , r < — . The probability that at least 2 Inn vertices return to the center is then < 

n m(m+l) m ^ j 

2 201nn -(22/m) 21nn < 1/n 5 , for n large enough. Finally, the probability that a random 2 In n-walk does not 
cover a clique of size m in lOn steps is at most m(l — ^) 20riln?1 < me~ 101nn < 1/n 5 . Now since C = 0(n 2 ) 
and C 21nn {K m ) = O(n), we get C* c = O(n). ' □ 

Proof of Lemma In the lemma we restrict k to be less than n to avoid rounding problems and for 
simplicity we also assume self loops in the clique. We will prove this using a coupon collector argument. 
Let C be the number of purchases needed to collect n different coupons. Consider the case where a fair 
mom decides to help her k kids to collect the coupons. Each time she buys a cereal and gets a coupon she 
gives it to the next-in-turn son in a round-robin fashion (i.e. kid i mod k gets the coupon from step i). 
Clearly, in expectation, after C visits to the grocery store mom got all the different coupons. Note that 
each child had his own independent coupon collecting process, and each have the same number of coupons 
(plus-minus one). □ 
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